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Abstract 

Various parametrizations of ct'^^ are tried out on the small-x fixed target and HERA data. A two- 
Pomeron type parametrization is found to give the best reproduction of the data. The data indicate 
that the value of AeS for parametrizations of the form a'^^\Q2 oc approaches a constant value 
at high . The extrapolated values of a'^^ to very long coherence lengths are found to cross in 
some parametrizations for I > 10^ fm, indicating the possibility that a'^^ becomes independent 
at large values of the coherence length I. 



1 Introduction 



The small-x behavior of the proton structure function F2 is striking and has inspired many models and 
parametrizations. In this paper, fits using different parametrizations of the photon-proton cross section 
are compared. The standard Hand convention [1] is used to define the photon flux, yielding the relation: 

- x) p 

where a is the fine structure constant and Mp is the proton mass. Given a parametrization for a'^^, we 
can compare predicted values of F2' to data. Data from E665, NMC, HI and ZEUS have been used for 
these fits. 

The behavior of a^P is studied in the proton rest frame in terms of the coherence length of the 
photon fluctuations, I, and the virtuality, Q^. The physics picture is given in Fig.[T] where the electron 
acts as a source of photons, which in turn acts as a source of quarks, antiquarks and gluons. The partonic 
wavef unction of the photon state is dependent on / and Q^. The proton is viewed as a set of interaction 
centers for the incoming partons. 




Figure 1: Photon fluctuations scattering on the proton in the proton rest frame. 



At small-x, the coherence length of the photon fluctuations [2] in the proton rest frame is a more 
intuitive variable than Bjorken-x, and we will use this variable and Q"^ in our parametrizations. Recall 
the definition of the coherence length, /: 

I 



AE 

where AE, the change in energy of the photon as it fluctuates into a system of quarks and gluons, is 
given by 

AE ^ (1) 

V 

where we have set the mass of the hadronic system into which the photon fluctuated, M = Q, and v is 
the photon energy. This yields 

^ 2xMp 

Note that M p only appears because of the definition of x. This is the standard expression used in the 
literature for the coherence length, although other possibilities have also been discussed taking different 
choices for IvP [3]. 
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The data used in this analysis are restricted to small x values so that I is many times larger than 
the proton radius. For compact photon fluctuations, the maximum value of a'^^ is given by the size of 
the proton multiplied by a, giving roughly 200 //barn. However, pQCD calculations have pointed to 
the property of 'color transparency' for small dipoles [4], indicating that at large the cross section 
should behave as a'^^ a l/Q^. I.e., the proton appears almost transparent for small dipoles with the 
cross section proportional to the size of the photon. The photon state will have a maximum size which 
is expected to be set by the mass of the lightest vector meson. An effective mass will be used as a free 
parameter in the fits. All paiametrization will therefore have a basic term: 

M2 



where ctq is expected to be a typical hadronic cross section (multiplied by a). The parametrizations will 
primarily be used to discriminate the / dependence of the photon-proton cross section. 

In the next sections, we start by reviewing the data sets which have been used for the analysis. This 
is followed by a discussion of the parametrizations which have been studied, and a description of the 
fitting technique employed (Bayesian analysis based on Markov Chains). The results are then presented 
and discussed. 



2 Data Sets 

The proton DIS data used in this analysis were taken from the Durham Data Bas^E with the exception of 
the HI data from the 1999-2000 running period which was taken from the HI collaboration web page. 
For the final results, only data with x < 0.025 have been used. Data from the fixed target experiments 
NMC and E665 were used, as well as data from HI and ZEUS. BCDMS data was not included since 
only a few data points are available from this experiment in the x range of interest. Table [T] summarizes 
the main aspects of the data sets used in this analysis. 



Table 1 : Tlie data sets used in this analysis. 



Experiment 


X 


min 




0^- 
^min 


0^ 

^max 




#data 




Ref. 










(GeV2) 


(GeV2) 


X < 0.025 


X < 0.02 


X < 0.01 




E665 


8.9 


10" 


-4 


0.229 


13.4 


64 


58 


45 


[5] 


NMC 


3.5 


10- 


-3 


0.8 


7.2 


51 


39 


13 


[6] 


HI SVTX 


6.1 


10" 


-6 


0.35 


3.5 


44 


44 


44 


[7] 


ZEUS 97 


6.3 


10- 


-5 


2.7 


800 


147 


131 


116 


[8] 


HI 97 


3.2 


10" 


-5 


1.5 


120 


116 


110 


100 


[9] 


ZEUS BPT 


6.2 


10" 


-7 


0.045 


0.65 


70 


70 


70 


[10] 


HI hiQ 


3.0 


10" 


-3 


150 


1000 


26 


18 


11 


[11] 


HI 9900 


2.0 


10" 


-3 


100 


1000 


27 


19 


13 


[12] 



In the experimental analyses, F2 is extracted by first correcting the observed data for several effects, 
including QED radiative effects and the contribution from the longitudinal structure function, F^. Differ- 
ent codes and assumptions have been used for these purposes by the different collaborations. In addition, 
the QED radiative coiTcctions depend strongly on the technique used to measure the kinematic variables, 

' http://durpdg.dur.ac.uk/HEPDATA/ 
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and are usually largest at large y = Q'^/ s. The coiTection for Fl is proportional to y'^, and therefore also 
has the largest impact at large y. On the other hand, energy scale uncertainties have the biggest impact 
at small y, and give uncertainties which increase strongly at small y. These effects result in correlated 
variations of the data. To account for this, the data were multiplied in some fits by a function 

/(2/) = l + - + fey2 (3) 

y 

where a and b are fit parameters. The parameter values were assumed to be Gaussian distributed with 
mean 0. The Gaussian widths for the variations are given in Table [2l 

Table |2] also lists the normalization uncertainty on the different data sets. Since the fits described 
in the next section included an overall constant, ctq, the normalization of one of the data sets was fixed 
- the ZEUS97 data set. The normalization uncertainties ai^e therefore relative to the normalization of 
the ZEUS 97 data. For this reason, the normalization uncertainty of the ZEUS BPT data set used in 
the fits was only 1 %, since the same running period and the same luminosity measurement was used 
as for the ZEUS97 data set. The remaining normalization uncertainty accounts for the efficiency of 
the BPT system itself. The NMC data at muon beam energies of 90, 120, 200, 280 GeV as reported in 
reference [6] were combined into one file and an overall 2.5 % normalization uncertainty was assigned 
(i.e., the relative normalization uncertainty of 2 % discussed in [6] was ignored). The HI data were each 
assigned an independent normalization factor since they corresponded to different data taking periods or 
rather different experimental conditions. 

In cases where the correlated systematic uncertainties as a function of y were used, the uncertainties 
on individual data points were calculated as follows: 

ef = staff + uncorrf (4) 

where stati is the statistical uncertainty on point i and uncorri is the uncorrelated systematic uncertainty. 
The latter was calculated as 

uncorrf = max [systf — corrf, O] 

where sysU is the total systematic uncertainty reported on point i (not including the overall normalization 
uncertainty) and corri is given by 



corrf 



(7, 



2 



y 



2\2 



Fi . (5) 



In the fits where the y-dependent systematic uncertainties were not used (the majority of fits), then 
the uncertainty on each point was simply taken as the sum in quadrature of the statistical and systematic 
uncertainties: 

ef = statf + systf . (6) 

The systematic effects were then studied by changing the kinematic range over which the fits were 
performed. For all fits, the relative normalization of the different data sets was allowed to vary. 



3 Parametrizations 

As mentioned in the introduction, all parametrizations studied have the same basic dependence, and 
are of the form ^ 

They are distinguished by the form chosen for /(Z). The various forms attempted were: 
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Table 2: The systematic uncertainties used for the different data sets. Note that the normalization uncer- 
tainty of the ZEUS97 data was set to in the fits. The normalization uncertainty of the ZEUS BPT data 
was reduced to 1% since the same luminosity measurement was used as for the ZEUS97 data. 



Experiment 



Normalization 
Uncertainty (%) 



a 



E665 
NMC 
HI SVTX 
ZEUS 97 
HI 97 



1.8 
2.5 
3.0 

2.2(0) 
1.7 

2.3(1) 



0.005 
0.001 
0.0003 
0.0003 
0.0005 
0.0002 
0.0005 
0.0005 



0.15 
0.2 
0.2 
0.2 
0.2 
0.1 
0.1 
0.1 



ZEUS BPT 



HI hiQ 
HI 9900 



2.0 
1.8 



D A form inspired by the observed features of the HERA data [13]. 



fil) 



- 11 HQ') 



where A = eo + e'ln((5^ + Qo) giving the 6 free model parameters: ao, M"^ eo, e' ,Qq. In this 
parametrization there is a 'soft' energy dependence at small Q^, a oc l'^'^, with a linear increase of 
the power of / with In at high Q^. The variation of A with is a well-known feature of the HERA 
data, so it is included here exphcitly. The expected values of the parameters are: 

do is expected to be of order 100 /ibarn, since this is the order of magnitude of the measured photo- 
production cross section. 

is expected to be around ^ 0.6 GeV^, since the p is the lightest vector meson; 

^0 could be of order of the proton radius (0.9 fm), as this is the only dimensionful scale available. 
Another guide is the minimum value of / 0.1 fm; 

eo should be very close to the 'soft Pomeron' value of Donnachie and Landshoff [14], and the more 
recent update [15] which yields about 0.1; 

e' gives the rate at which A is increasing with In at high Q^. The data indicates that the value is 
around 0.05; 

Qq is presumably near 1 GeV^, since this is where the rise in the energy dependence has been ob- 
served. 

The prior probability distributions are assumed flat within large ranges around the expected values. All 
parameters are constrained to be positive. 

2P A two-Pomeron model [16] inspired parametrization: 
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giving the 6 free parameters: do, M^, /q, eoi cii In this pai^ametrization, there is a smooth transition 
from a 'soft Pomeron' with intercept 1 + eo to a 'hard Pomeron' with intercept 1 + ei. The square-root is 
not obvious, but was found to give excellent fits. The first four model parameters are as in the D model 
above, and should have similar- values in the fits. For the other parameters: 

ei a value in the range of 0.3 — 0.5 is expected. It should clearly be larger than the largest values of 
Acff which have been seen in previous fits to HERA data, but is not expected to be larger than the 
LO BFKL Pomeron [17] value of 0.5; 

gives an intermediate value where the 'soft' and 'hard' Pomerons ai^e both contributing. It is 
clearly much larger than 1 GeV^. In the fits, values up to 100 GeV^ were allowed. 



BH An extended form of the Buchmiiller-Haidt parametrization [18]: 

y(0 = ^ + ,„(| + ft),„(5) 

giving the 6 free parameters: ctq, M^, ^, Qq, Pi, xq. Note that Buchmiiller and Haidt fit the F2 data 
directly, and only considered the range > 5 GeV^. The parametrization used here is extended in that 
the parameters M^, Pi are used to give a smooth transition to the photoproduction region. The best fit 
pai^ameters found in [18] would indicate that 

A should be near the ratio a/m = 0.078/0.364 0.2 of the parameters from [18], although the 
value will be strongly correlated to the other fit parameters; 

Qq, Pi should be of order 1; 

xq was found to be 0.074 in [18], and should be near this value in this parametrization also. 

This is in no way an exhaustive list of parametrizations which have been tried out for the small-x 
data, but the parametrizations given here are intended to be a representative set to guide the discussion 
on the features of the data. Many other parametrizations have also been used, and found to give good 
results, so a good fit from one of the parametrizations discussed above does not guarantee that it contains 
the right physics motivation. 



4 Fitting Technique 

The parameter value extraction is based on the learning rule: 

P(\\D,h) ^ P{D\\h)PQ{\\Io) 

where the term on the left is the posterior probability density for the fit parameters. A, given the data, D, 
and all assumptions, /q, (including model chosen). The terms on the right are the probability density of 
the data given the model and parameters (likelihood) and the prior probability density for the parameters 
given the model chosen. The posterior is normalized at the end, yielding the famous result associated 
with Bayes theorem: 

P(A>,Io) = ^™^^ (7) 
/P(I)|A,/o)Po(A|/o)(iA 
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Normalization 



0.15 
0.1 
0.05 




1E665 



: NMC 



; H1 SVTX 



lZEUS 97 



HI 97 



_ ZEUS BPT, 



HI Hi 



-H1 99-00 



0.8 0.9 1 1.1 1.2 

Factor 



Figure 2: Probability density distribution for the normalizations of the different data sets used in the 
fitting. These normalizations are for fit 22. 
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0.8 




0.0575 



Figure 3: Probability density distribution and contours for P{lo, £'\D, Iq) for fit 22. 



The prior probabilities are chosen to be flat for all model parameters. As described above, the nor- 
malizations for each data set, except ZEUS97, are left free and the priors are Gaussians with widths 
specified in section [2l The likelihood of the data is evaluated assuming a Gaussian probability distri- 
bution for fluctuations from the model expectation, with the Gaussian variance given either by Eq. |4] 
or Eq. [6] For each data point, the model prediction, F^"^'^'^, is compared to the data point, F2^i, scaled 
by the normalization constant, S'cxpt, and, in some fits, the y dependent factor to account for correlated 
uncertainties. The log of the probability density for one data point is calculated as 

lnP(F,,|A,/o) =ln^ -0.5 ( - F2,J{y)S..,X 

The log of the total probability density for the data is given by the sum of the contributions from the 
individual data points. This is then added to the log of the probability density for the prior to give the 
(unnormalized) value of In P{X\D, Iq). 

The total number of fit parameters ranged from a minimum of 13 for the model parameters and free 
normalizations, to 27, if all parameters related to the data sets were also allowed to vary. The parameters 
related to the data sets are so-called nuisance parameters, and can be integrated out if needed: 

^'(Amodcl|^,/o) = y"p(A|Z),/o)dAdata (8) 

where A = {Amodci, Adata} 
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4.1 Markov Chain Monte Carlo 



The posterior probability, Eq. |2l is determined using a Markov Chain Monte Carlo (MCMC) [19]. For 
each parameter, the allowed range, starting value, and proposal function are set. The proposal function 
for varying the pai^ameters was always chosen flat. The MCMC implementation automatically changed 
the sampling range of the proposal function after an initial number of iterations, in order to have the 
efficiency for keeping the proposed parameter value in the range 10-50%. The MCMC was run until the 
mean and rms of P{X\D, Iq) over sufficiently long periods was stable. Once convergence was achieved, 
the MCMC was run to save the distribution of P{X\D, Iq) (individual instances of A were saved). This 
gave access to the full probability density distribution. Any function of the parameters can then be 
evaluated, as well as the probability density for this function, without approximations. The integral in 
Eq. |7]is not solved for explicitely. Rather, the MCMC guarantees sampling of P{X\D, Iq) according to 
the correct probability density, so that the normalization is given by the total number of events in the 
chain (after convergence). 

As an example, the normalizations for the different data sets for fit 22 (see Table [3]l are shown in 
Fig.|2] The results for the normalizations from the different fits are given in the next section. 

The correlations amongst the fit parameters are available via the posterior probability density. As an 
example, the probability density contours for the marginalized density 

P(/o,e'|D,/o) = j Piao,M'^,lo,eo,e',Ql\d^t^\D,Io)d(JodM^deodQldXd^t^ 
for the fit 22 are shown in Fig. [3l 



4.2 Goodness-of-fit 

The goodness-of-fit was evaluated using a 'p-value' defined as follows: 

p = ^ ' (9) 

/ P*{x)dXdi,t!,dx 

where 

P*(x) = P(:r|AJnodel>-^data,^o)-Po(Adata|-fo) 
P^ = ^(^l-^modcb -^data' ^o)-Po(Adatal^o) • 

Here x is a possible realization of the data, D is the observed data, and A^^^^i, A^^^^^^ represent the values 
of the model and data parameters which maximize P{X\D, Iq). Note that only the model parameters are 
fixed at the values giving the mode for P*{x), while the nuisance parameters are allowed to vary accord- 
ing to their priors. This p-value is a tail area probability, and is just the probability that an experiment 
would observe a smaller (likelihood data prior) than the one found, assuming the model is conect. If 
the likelihood accurately describes nature and the experimental effects, then the p-value distribution for 
many experiments would be flat between (0, 1). Small values therefore suggest that the model is unlikely 
to be a good representation of nature (assuming the modelling of the experimental conditions was done 
correctly), since this says that few experiments are expected to have such small values of P^. In practice, 
the p-value was calculated using 10000 simulated experiments where the normaUzations were allowed 
to vary according to the probability density in the priors, and the data points were fluctuated around the 
expectation value with a probability density given by a Gaussian with appropriate width. The fraction of 
simulated experiments satisfying the condition given on the integral in Eq.|9]was then calculated. 
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4.3 Bin-centering and further analysis 



The data from the different experiments are reported in some cases at fixed values of x and varying Q^, 
in other cases at fixed and varying x, or at fixed and varying y. This is no problem for the model 
fitting procedure, but it does make data presentation cumbersome. The data were therefore 'bin-centered' 
by moving data to fixed values of using the parametrization under study, as follows: 

F2{Qlx) = ^^^^F2(Q2,x)/(y)5expt - (10) 

where f{y) is defined in Eq. [3]and S'expt is the normalization factor for the experiment in question. The 
following Ql values have been used: 0.15, 0.4, 0.8, 1.5, 3, 5, 7.5, 10, 15, 20, 30, 40, 50, 70, 100, 150, 
200, 250, 300, 400, 500, 600, 750, 1000 GeV^. 

The MCMC was then rerun for the bin-centered data at each value using the simple parametriza- 
tion: 

a^P{l)=afl^^« (11) 
and the probability density P((Tq^, Xcs\D, Iq) was evaluated separately in each bin. 



5 Results 

Fits were performed using the parametrizations described in Section [3] The kinematic range over which 
the fits were performed was also varied. The results were stable for x < 0.025, but clearly deteriorated 
if the upper limit for x was increased to x < 0.05. The best fits were achieved with the D and 2P 
parametrizations, while the BH parametrization was less successful at describing the data. 

Table 3: Summary of the fit results for different parametrizations. The first column gives the fit number, 
which is used as a label in subsequent tables and plots. In cases where no value of Xmin is specified, all 
data with x < Xmax were used. If no value of y max is given, then data up to the largest available y were 
used. The p-value, indicating the goodness-of-fit, is given in the last column. There is no p-value for fit 
26 since in this case no global fit was performed (only the extraction of Xcs). 



Fit 


Parametrization 


Xmin Xmax Umax 


P 


27 


D 


0.01 


0.89 


22 




0.02 


0.79 


24 




0.025 


0.22 


15 


(Ql = M^) 


0.02 


0.08 


31 




0.01 0.3 


0.95 


28 


2P 


0.01 


1.0 


21 




0.02 


0.99 


25 




0.025 


0.91 


26 




0.002 0.025 




29 


BH 


0.02 


0.03 



A summary of the fit results is given in Tabled while the fitted parameter values for some reference 
fits are given in Table |4]|6l Each fit is assigned a number which will be used in the remainder of the 
document as a reference number. 
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E665 
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Figure 4: The E665 F2 data with the D parametrization superposed (fit 22) . 
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Figure 5: T/ie NMC F2 data with the D parametrization superposed (fit 22). In bins where only one data 
point is available, the prediction is shown with a square. 
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Figure 6: The HI SVTX F2 data with the D parametrization superposed (fit 22). In bins where only one 
data point is available, the prediction is shown with a square. 
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Figure 7: T/ie ZEUS 97 F2 data with the D parametrization superposed {fit 22). In bins where only one 
data point is available, the prediction is shown with a square. 
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Figure 8: T/ze //7 97 F2 (ia;?<3 w/^/i D parametrization superposed (fit 22). 
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Figure 9: T/ie ZEUS BPT F2 data with the D parametrization superposed (fit 22). In bins where only one 
data point is available, the prediction is shown with a square. 
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Figure 10: T/ze HI hiQ F2 data with the D parametrization superposed (fit 22). In bins where only one 
data point is available, the prediction is shown with a square. 
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Figure 11: The HI 9900 F2 data with the D parametrization superposed (fit 22). In bins where only one 
data point is available, the prediction is shown with a square. 
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The p-values cleai^ly distinguish between the D and 2P models on the one hand, and the BH and 
other models (not shown) which have been fitted to the data. The p-values are very close to 1 for the 
D and 2P fits, indicating that adding the statistical and systematic errors in quadrature as done in Eq. [6] 
overestimates the uncertainties. On the other hand, handling the y-dependent systematic effects as in 
Eq. |4] resulted in small probabilities for the fit results (typically 0). However, the fitted parameter values 
changed only slightly. No results are quoted for the fits where an attempt has been made to subtract 
the correlated systematics uncertainties. The fitted data are shown in Figs |4]- [TT] together with the D 
pai^ametrization (fit 22) for the case x < 0.02 to give a sense of the quality of the fits. 

Several quantities are calculated from the Markov Chain output and reported in the tables. The 
definitions are as follows: 



mode of Aj The value of Aj which maximizes the marginalized posterior probability density 

m'ax{P(A,|L',Io) = j P{\\D,h)d\j^i} 

Mean of Aj The expectation value 

< A, >= j P{\i\D,h)\id\i 

Median of Aj The value of Aj such that 50 % of the probabilty is below this value 

/ P(Ai|L),Io)dAi = 0.5 
where Xmin is the minimum possible value for parameter A^. The desired value is Xmed- 



Central Interval The (1 — 2a) central interval is defined such that a fraction a of the probability is 
contained on either side of the interval 

a= P{Xi\D,Io)dXi= / P{Xi\D,Io)dXi 

where the desired interval is [Xiower, Xupper]- The minimum and maximum values of the parameter are 

Xmim Xraax- 



rms The root-mean-square is defined as usual 



rms 



j P{X,\D,Io)XjdXi -{^j P{Xi\D,Io 



)XidXi 



The parameter estimates were found to be near the expected values, and no parameter was at the 
limit of the allowed range. The parameter values for a given parametrization did not show strong vari- 
ations when changing the kinematic range for the fits. Larger variations were seen from changing the 
parametrization from one form to another. As is cleaiiy seen in the tables, the fitted normalization values 



18 



Table 4: Parameter results for the D parametrization for fit 27. 



Fit 


Paiameter 


Mode 


Mean 


Median 


68 % central range 


rms 


27 


do (mbarn) 


0.0627 


0.0641 


0.0635 


0.0616 


- 0.0669 


0.0026 


27 


Af 2 (GeV2) 


0.635 


0.630 


0.632 


0.591 


- 0.665 


0.034 


27 


lo (fm) 


1.22 


1.19 


1.19 


1.06 


- 1.30 


0.118 


27 


eo 


0.0671 


0.0674 


0.0671 


0.0613 


- 0.0737 


0.0062 


27 


e' 


0.0636 


0.0637 


0.0626 


0.0606 


- 0.0646 


0.0020 


27 


Ql GeV2 


1.54 


1.44 


1.39 


1.19 


- 1.54 


0.17 


27 


E665 norm 


1.017 


1.016 


1.015 


1.004 


- 1.027 


0.011 


27 


NMC norm 


1.042 


1.044 


1.044 


1.031 


- 1.056 


0.014 


27 


HI SVTX norm 


0.933 


0.934 


0.933 


0.921 


- 0.945 


0.012 


27 


H197 norm 


1.030 


1.030 


1.030 


1.026 


- 1.033 


0.004 


27 


ZEUS BPT norm 


0.985 


0.983 


0.983 


0.974 


- 0.991 


0.008 


27 


HI hiQ norm 


1.009 


1.007 


1.007 


0.994 


- 1.019 


0.013 


27 


HI 9900 norm 


1.015 


1.017 


1.017 


1.008 


- 1.025 


0.008 



Table 5: Parameter results for the 2P parametrization for fit 28. 



Fit 


Parameter 


Mode 


Mean 


Median 


68 % central range 


rms 


28 


(To (mbarn) 


0.0583 


0.0579 


0.0577 


0.0555 


- 0.0601 


0.0024 


28 


Af 2 (GeV2) 


0.581 


0.576 


0.579 


0.539 


- 0.605 


0.032 


28 


lo (fm) 


0.538 


0.505 


0.507 


0.428 


- 0.577 


0.075 


28 


eo 


0.0855 


0.0836 


0.0837 


0.080 


- 0.087 


0.0033 


28 


ei 


0.370 


0.366 


0.366 


0.358 


- 0.373 


0.008 


28 


A2 GeV2 


31.6 


31.6 


31.5 


28.8 


- 34.3 


2.7 


28 


E665 norm 


1.000 


1.004 


1.004 


0.993 


- 1.015 


0.011 


28 


NMC norm 


1.027 


1.025 


1.025 


1.012 


- 1.037 


0.013 


28 


HI SVTX norm 


0.936 


0.935 


0.935 


0.923 


- 0.947 


0.012 


28 


HI 97 norm 


1.033 


1.035 


1.034 


1.031 


- 1.038 


0.004 


28 


ZEUS BPT norm 


1.000 


1.000 


1.000 


0.991 


- 1.009 


0.009 


28 


HI hiQ norm 


0.965 


0.964 


0.964 


0.951 


- 0.977 


0.013 


28 


HI 9900 norm 


0.970 


0.968 


0.967 


0.958 


- 0.976 


0.009 
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Table 6: Parameter results for the BH parametrization for fit 29. 



Fit 


Parameter 


Mode 


Mean 


Median 


68 % central range 


rms 


29 


ctq (mbarn) 


0.0320 


0.0330 


0.0326 


0.0298 - 


0.0367 


0.0032 


29 


(GeV^) 


0.273 


0.275 


0.271 


0.241 - 


0.305 


0.029 


29 


Qo (GeV2) 


0.930 


0.938 


0.934 


0.871 - 


1.001 


0.065 


29 


PI 


1.35 


1.38 


1.37 


1.31 - 


1.44 


0.06 


29 




0.0442 


0.0448 


0.0446 


0.0421 — 


0.0470 


0.0025 


29 




2. by 


2.d8 


Z.D 1 


2.58 - 


2.77 


O.IU 


29 


E665 norm 


0.975 


0.975 


0.975 


0.964 - 


0.987 


0.012 


29 


NMC norm 


0.953 


0.953 


0.953 


0.940 - 


0.967 


0.014 


29 


HI SVTXnorm 


0.921 


0.921 


0.921 


0.909 - 


0.933 


0.013 


29 


HI 97 norm 


1.022 


1.027 


1.027 


1.023 - 


1.030 


0.004 


29 


ZEUS BPT norm 


0.997 


0.995 


0.995 


0.987 - 


1.004 


0.008 


29 


HI hiQ norm 


0.998 


1.000 


1.000 


0.988 - 


1.012 


0.013 


29 


HI 9900 norm 


1.006 


1.005 


1.005 


0.997 - 


1.013 


0.008 



for the different data sets depends strongly on the parametrization chosen, particularly for the data at 
the lowest and highest Q^. These variations ai^e larger than the fit uncertainty on the normalization in 
individual fits. The only data set which consistently requires a significant normalization correction is the 
HI SVTX data, where a normalization factor of typically 0.93 is preferred by the fits. 

The mode, mean and median of the fit values are very close, indicating that the posterior probability 
distributions for these parameters is symmetric. Also, the 68 % central range is very nearly twice the rms 
value, indicating that the distributions are Gaussian in shape. 

The three parametrizations (fits 27, 28,29) ar^e compared to bin-centered F2 data in Figs. [T2lfT6l 
Fit 27 was used to bin center the data as explained in section 14.31 As can be seen from these plots, all 
parametrizations follow the general trend of the data. However, the BH parametrization has a too-shallow 
X dependence at low while the D parametrization has a too-steep x-dependence at the highest Q^. 



5.1 Fits of parametrization D 

The value of do is the expected photoproduction cross section for / = Iq and = 0. The value at larger 
I is given by 

„ n /; nO.095 

The exponent is within errors identical to the one found in [15] for hadron-hadron total cross sections. 
In principle, this could be compared to measured photoproduction cross sections. However, this would 
require knowledge of / in photoproduction, whereas what is known is W'^ = Q^/x. This comparison 
is therefore difficult. However, it is possible to take the measured photoproduction cross section at 
HERA [20,21] of approximately 170 fibain to deduce / « 5 • 10"^ fm; i.e., x Pi 2 ■ 10~^, similar to the x 
range of the ZEUS BPT data. However, the photoproduction data are at smaller and therefore should 
correspond to larger /, indicating possible problems with the parametrization as — > 0. 
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Proton Structure Function 




-2 



X 



Figure 12: Comparison of parametrizations to bin-centered F2 data for x < 0.01. The values near the 
top of each box give the value in GeV^ to which the data have been moved. The symbols for the 
different data sets are given in the lower right plot, while the different parametrizations are defined in 
the upper left plot. 
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Proton Structure Function 
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Figure 13: See caption ofFig.\T2\ 
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Proton Structure Function 
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Figure 14: See caption ofFig.\T2\ 
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Figure 15: See caption ofFig.\T2\ 
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Figure 16: See caption ofFig.\T2\ 
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The value of is within uncertainties the same as M^, and the value of Iq is very close to the 
proton radius. The parametrization D has an interesting property, namely, that if M = Qq, then the cross 
section will go to a fixed point atl = Iq exp 1/e' independently of the starting Q^. In fact, 

requires 



+ ■ 

For Q2 ^ this gives for fit 27 / f« 8 • 10^ fm, while for = the value is / = 5 • 10^° fm. 
A possible interpretation is that the data are headed for a fixed point, but the behavior softens once the 
evolved photon state approaches hadronic dimensions. 

The a^'^ cross section data is shown versus I in Fig. [T7] The data used here aie bin centered using 
the values from fit 22, whereas the straight lines are from the fits to cr = o"o^^'=** for each value. A 
blow up of the large I region, including the error bands from the fits, is shown in Fig. [18] for a subset of 

values. The extrapolated a^'^ cross near I = 10^ fm. At larger values of Q^, the crossing point tends 
to move to lower I, but the uncertainties aie larger. 



5.2 Fits of parametrization 2P 

Again here, the 'soft-Pomeron' value found from hadron-hadron scattering is recovered within the un- 
certainties. For = 0, we find 



a = ao{l/lo) 



0.086 



The value of the 'haid-Pomeron' intercept is found to be 1 + ei f» 1.37, with equal contribution from 
the 'soft' and 'hard' Pomerons around = f« 30 GeV^. In this parametrization, there is no unique 
crossing point, and the dependence of a'^^ on I becomes universal at high Q^. 

Comparing to the measured photoproduction cross section at HERA as done above, we find a value 
of / f« 1.4 • 10^ fm for the HERA photoproduction data, which is somewhat larger than the result of the D 
parametrization. A effective mean value of / 1 • 10^ fm is allowed within the parameter fit uncertainties 
and the uncertainties of the measured photoproduction cross section, which would correspond to the 
more reasonable effective mean x f« 10~^. This again indicates that the 2P parametrization is a better 
representation of the data than the D parametrization. 



5.3 Fits of the parametrization BH 

As mentioned above, the fits with this extended version of the Buchmiiller-Haidt parametrization does 
not work as well as the D or 2P fits. The difficulty in the fitting is primarily at lower Q^. 

The value of 1/^ in these fits is considerably lower than in the D or 2P fits. The reason is that a 
part of the softening of the behavior is already accounted for in the logarithm. The value of Qq is 
somewhat larger than the value found by Buchmiiller and Haidt (Qq = 0.5 GeV^), while the value of xq 
is somewhat lower (0.044 versus 0.074 in [18]). 
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5.4 Acff as a function of 



One of the most striking results from the early HERA data was the dependence of the steepness of the 
rise of F2 at small-x on [13]. At larger Q^, a Ag// tx InQ^ dependence was found, while the data 
indicated a flattening at smaller Q^. We use the data sets and fits described above to study the behavior 
in more detail. 

Table 7: Comparison of X^fi values from the data using different parametrizations for the bin centering 
and different x ranges. The different fits are described in Tabled 
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The quantity 

was calculated for each parametrization. In terms of the parameters of the fits, we have 

D: Aeff = eo + e'ln(Q2 + Q2)^ 



2P: AcflF = eo + (ei - eo)y qt^, 
RTT. ^ _ ln(QVQ§ + i) 

^''^ yl+ln(Q2/Qg+l)ln (l/lo)' 

The values of X^s for the different parametrizations are plotted versus Q2 in Fig. [H and a more 
complete set of results is given in Table |7] The value from the BH parametrization is I dependent and 
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therefore not given in the table. Two curves are plotted in Fig. [19] for the BH paiametrization, one for 
= 20000 GeV^ and one for = 70000 GeV^. These values represent the typical range from 
HERA data. The data in the plots used the bin centering from fit 27. 

The D and 2P parametrizations follow the / dependence of the data equally well up to Q"^ values of 
about 70 GeV^. Beyond this value, the 2P parametrization is closer to the data. Note that the mean value 
of I for the existing data is decreasing as increases. It was checked whether the turnover of Aeg at 
higher is due to this by limiting the fitted values to x > 0.002. The results are presented in Table |7] 
(fit 26) and shown in Fig. |20l A turnover is still seen, although the uncertainties are of course much 
larger. One point, at Q'^ = 1.5 GeV^, shows a large discrepancy when the data is limited to the range 
0.002 < X < 0.025 (fit 26). The reason is that with this cut, no HERA data ai-e available at this Q^. The 
fitted uncertainty on the value of X^g is small because of the small uncertainties of the fixed target data. 

It is difficult to conclude at this point how seriously to take the observed turnover of Acfr at the higher 
values. While the effect seems to be real, it could still be explained by a combination of Umited data 
and a changing / range as changes. Note that only HERAI data have been used in this analysis. Both 
HI and ZEUS will report new high y cross sections in the near future, which should greatly improve the 
measurement of Aegf in this region, thus allowing for a better analysis in this kinematic region. 



6 Summary 



Parametrizations of a'^^ have been fitted to small-x fixed target and HERA data using a Bayesian analysis 
based on a Markov Chain Monte Carlo. Results for three different parametrizations: 



D: a-'P 

2P : a'y^ 
BH : cj'^-P 



M2 



(^0 



^ \ eo+e'ln(Q2+Q2) 



Q2 + M2 V^o, 



M2 



Q2 + M2 



X 



are given in this paper. The best fits are from the 2P parametrization (inspired by the two-Pomeron 
model [16]). This conclusion goes hand-in-hand with the observation of a flattening of the effective 
growth of the cross section at large coherence length, This flattening is intruiging, and should be 
verified with future HERA data. The D parametrization, inspired by the known properties of the HERA 
data, does not allow for this turn-over. It also gives good fits overall, but clearly does not reproduce 
the observed X^g values at high Q^. The Buchmiiller-Haidt [18] inspired parametrization, BH, does not 
follow the general trends of the data at the lowest and highest Q^. 

A possibly interesting observation is the vanishing of the variation of a^'^ when extrapolated 
to particular very large values of /. This happens automatically in the D type parametrization at I = 
/oexp 1/e' if Qq = Af2. This behavior indicates that a"^^ will have a roughly independent value 
for I > 10^ fm. An interpretation would be that the photon state has evolved sufficiently after this 
coherence length that it has lost memory of its initial configuration. This speculation is based on extreme 
extrapolations, and more data would of course be needed to check this behavior. 

Many different parametrizations, not reported in this paper, have been attempted to fit the small-x 
data, including some which gave very good fits. They look very different in form, so that it is impossible 
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Photon-Proton Cross Section 




Photon-Proton Cross Section 
1 I 




Figure IS: An expanded view of the large I region for the same conditions as those in Fig. \17\ The 68 % 
central interval from the fits are shown as bands. 
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Figure 19: The values of \cs as defined in the text as afiinction of using centering from fit 27. The 
dashed-dotted curve is for the BH parametrization (fit 29) and = 20000 GeV^, while the dashed 
curve is for the BH parametrization with W"^ = 70000 GeV^. The solid curve is the D parametrization 
(fit 27), while the dotted curve is the 2P parametrization (fit 28). The band represents the range of e from 
fits to hadron-hadron scattering data [15]. 
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Figure 20: Comparison of X^ff data values for different kinematic ranges and different parametrizations 
as defined in Table 13 The data points have been slightly offset along the axis so as not to have 
overlapping uncertainty bands. 
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to draw strong physics conclusions from the success of any one fit. The possible turn-over of X^s at high 
would provide a distinguishing feature which would rule out many possible forms and hopefully 

point to the correct physics. Further, higher precision data from HERAII is expected on this front and 

could help clarify the situation. More extensive and more precise data in the transition region at low 
would also help in weeding out incorrect approaches. Here, a proposed EIC [22] would make a big 

difference. For the extrapolations to very large I, obviously the LHeC [23] would be the best tool. 
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